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INTELLIGENT INFORMATION SYSTEMS WITH LEARNING CAPABILITIES 

Abstract 

An intelligent information system is designed to derive information (that 
may not be explicitly stored in the data base) by application of rules for 
inferring plausible answers to queries. The system is divided into the 
knowledge base (KB) and the inference engine (IE). The KB can be further 
partitioned into a factual base (FB) and an explanatory base (EB). The FB 
is used for storing facts (data) that may be imprecise or incomplete, and 
the EB contains knowledge; i.e., flexible (fuzzy) concepts, relationships, 
or rules that are used to interpret the available data. The IE is designed 
to perform flexible reasoning. Clearly, the "intelligence" of the system 
depends on the knowledge available in the KB and the types of inferences 
that the IE is capable of performing. An experimental system (APPLAUSE) is 
discussed, together with demonstration of system function in the knowledge 
acquisition and querying modes, including its explanatory capabilities. 
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THEME 


• Make Machines Behave Intelligently 
Marks of intelligence: 

• Learning Capability 

• Reasoning with Insufficient, Unreliable or Imprecise Data 

• Reasoning Under Resource Constraints 


• Creativity- Discovery 


PLAUSIBLE REASONING 


• A Core theory- proposed by Collins and Michalski 

• Modifications and Implementation 

MODEL 

• Hierarchical Organization of Knowledge 

• Mechanism to Manipulate Incomplete and Uncertain 
Knowledge Base 

• Domain Independent Inference Mechanism 

• Theory is Operationalized with Chemical Periodic Table 


as a Test Domain 


KNOWLEDGE REPRESENTATION 


ELEMENTS OF EXPRESSIONS 


• Arguments 

• Descriptors 

• References 

• Terms 

• Facts - 

• Dependency - 

• Implication - 

• Hierarchies - 

• Similarity - 


veracity -/i, frequency- 0, confidence- 
forward- backward dependencies 
forward -backward implications 
generalization, specialization 
context, dominance, typicality 


ELEMENTS OF EXPRESSIONS 


Descriptor: d. 


breed 

temperature 

flies 


attribute 
attribute, function 
predicate 


Term s ; di(a x ), or d 2 (a u a 2 , . . .) 
breed(Fido) 

temper ature(latitude, altitude) 
temper ature(place) 

References: ri,{ri,...}. 

4 

true 
groupQ 


integer 

logical 

hierarchical 


Factual statements: di(ai) = r\ : [/z, 7^, 0, 7^] 


• p- veracity: Veracity indicates degree with which refer- 
ence r\ is applicable to descriptor-argument pair. 

• (j>- frequency: Frequency indicates proportion of argu- 
ment for which the reference is a valid description of the 
descriptor-argument pair. 

• 7/i» 1<}>- confidences in /z, 0. 


Examples: 

density (aluminum) = 2.7 : [1, .99, 1, 1] 

is-old(john) = yes : [.7, .9, 1, 1] 

engine jtype(car) = 4 _ cylinder : [1, 1, .8, .95] 

Dependency between terms: 


di(ai) * * d 2 (a 2 ) : [a, 7*, /3, 7^] 

• a,/3- forward and backward dependency strengths 
is -philosopher (X) < » is-greek(X) : [.5, .8, .0001, .8] 


Implications between factual statements: 


rfi(oi) = n <=$■ d 2 (a 2 ) = r 2 : [a,7 a ,/?,7/)] 

grain(jplace) = rice <=> 

rain(place) = [80. .120m] : [.9, .9, .5, .8] 

The implications can also be encoded by functions 

di(ai) = n d 2 (a 2 ) = /(n). 
radius (circle) = r <=> 


area(inscribed square) = 2 r 2 : [1,1, 1, 1] 


The transforms A GEN, A SPEC, A SIM, R GEN, R SPEC, 
R SIM allow traversal in a hierarchy, in the process of infer- 
ence. Simplified (no parameters) applications of the trans- 
forms are given below. 




• A GEN 


speed(computer_l) slow 

micro_computer = gen(computer_l): cx = alu_size 
alu_size(COMPUTER) < — > speed(COMPUTER) 
micro_computer = spec(COMPUTER) 
speed(micro_computer) = slow 



A GEN Transformation 











A SPEC 


height(basketbalLplayer) = tall 
karim = spec(basketbalLplayer) 
height(karim) = tall 


• A SIM 

economy(singapore) = strong 

hongkong = sim(singapore): cx = economic structure 
economy(hongkong) = strong 


R GEN 


reacts_with(potassium) = chlorine 
group7 = gen(chlorine) 
reacts_with(potassium) = group7 


R SPEC 


likes(mary) = carbonated-drinks 
coke = spec(carbonated-drink) 
likes(mary) = coke 


R SIM 


habitat(whales) = atlantic^ocean 
pacific_ocean = sim(atlantic-ocean) 
habitat(whales) = pacific.ocean 


• Combination of A SIM and R GEN 

reacts_with(potassium) = chlorine 
sodium = sim(potassium) A SIM 

reacts_with(sodium) = chlorine 
group7 = gen(chlorine) R GEN 






Theory of Plausible Reasoning and its 
Implementation. 


Collins and Michalski introduced a theory to model human 
plausible reasoning. APPLAUSE is an implementation of an 
extended and modified version of the theory. The method- 
ology is eminently suitable to reason in the domains where 
knowledge is organized hierarchically. The theory provides 
mechanisms to manipulate the knowledge base in case of in- 
complete and uncertain knowledge. Some features of the 
theory are highlighted with examples from chemical periodic 


table. 



QUERIES 


Form: 

descriptor [argument) = ref? /[\i, 7^, </>, 7^]? (1) 


Aim : 

In query 0 ) the system is to retrieve best reference value 
together with the estimated parameters. The best reference 
is one with highest fi * 7^ * <j> * 7^ product. 

• Type checking is performed for arguments and descrip- 
tors and references when applicable. 


ALGORITHM for processing Queries: 

• get_query(Q) 

• if ( get_fact(Q) successful) then - report retrieved in- 
formation, exit. 

• elseif reasoning_depth_counter > depthJimit then 

- combine whatever evidence available and exit. 

• else 


— increment depth.counter by one. 

— Dep set of dependencies/implications, such that 
descriptor occurs in RHS and a * > threshold T. 

— sort dependencies and implications according to de- 
creasing a * j a (gather strongest evidence first). 

— repeat until no more dependencies. 

* evaluate LHS of dependency or implication. If 
necessary call this routine to evaluate LHS. 

* apply suitable transforms such as A GEN, 
A SPEC, A SIM and compute RHS, decrement 
depth_counter by one and exit. 

• combine evidences: 

— choose best 7^, or /£*7 /n </>* products, for type 1 
or type 2 query respectively. 

— compute union and intersection of ranges to give up- 
per and lower bounds on the range of the conclusion 
respectively. 


EXAMPLE 


Given: 

o Group8a consists of gases [He, Ne, Ar, Kr, Xe, Rn]. 

• Boiling points of only 4 gases are known. 

[He/-269, Ne/-246, Ar/-185, Xe/-108]. 

Query: Find boiling point of Kr. 

Process: 


• Statistical analysis will be made to see if it is reasonable 
to aggregate the boiling points into a range and propa- 
gate it to a parent node. 

• Kr has two parents, group8a and period4. 

• Suitability of generalization is tested in both hierarchies. 


• The better one is selected for inference. 




Two possible hierarchies for periodic table 









Criteria for generalization: 


• Low standard deviation of residuals 

• Generalize from large number of points. 

• Low range of residual errors (fewer outliers) 

• Presence of functional dependencies with characteristics 
similar to those in the neighboring classes. 


• 'Causal connection' 


Criteria for generalization: 


• Low standard deviation of residuals 

• Generalize from large number of points. 

• Low range of residual errors (fewer outliers) 

• Presence of functional dependencies with characteristics 
similar to those in the neighboring classes. 


• 'Causal connection’ 


Total # of Points = 100 
Total # of Variables = 3 

Names of the attributes GROUP PERIOD ATNUM 
A suitable hierarchy is to be decided for attribute ATNUM 

Evaluating Group as the Primary Attribute for Classification 
Total Number of Distinct Classes = 18 












Evaluating Period as the Primary Attribute for Classification 
Total Number of Distinct Classes = 7 








Generalization group8a 

period4 

BP range 

[-108 .. -269] 

[58 .. 3450] 

slope m 

42 

-416 

std dev of 
residuals ay 

5% 

46% 

% intersection with 
neighboring class .r 

60% 

100% 

number of points n 

4 

17 

Computed a 

0.88 

0.8 

Computed y a 

0.93 

.3 


The equation (implication in a functional form) derived by 
best line fit method: 


BP = -317 -f 41.8*period (valid for group8a). 

The parameters a and y a are estimated by evaluating com- 
pliance to the criteria for generalizing. 


a = (1 — 0.2 * x) 

7 a = 0.5 * ( ” W ~ m ) + 0.4 * (1 - 0>) + 0.1 * 

max Homin' \7~l A ) 

Tabulated factors favor generalization in group8a rather than 
in period 4. 


















PARAMETERS FOR THE DERIVED CONCLUSION: 


Derivation using A SPEC without functional dependency 
BP(Kr) = [-108, -269] 

Parameters [/i, 7^, 0, 7^] are directly inherited from the parent 
node, however the precision of the answer is low. 

The answer is made more precise (narrower range) by using 
functional dependencies discovered in the related elements. 

Derivation using A SPEC together with functional dependency. 

Assume r and 7 r for Kr within group8a = .9, .95 
These can be estimated by evaluating common relevant fea- 
tures among the siblings. 

BP(Kr) = -317 -f 41.8*4 = -149.8 

Ac = A*i = 1 
7 He = * a * 7a * T * 7r 

= 1 * .9 * .9 * .9 * .95 = 0.6925 
(j) c (fi\ 1 

Idc — 1<1>\ * a * 7a * T * 7 r 

= 1 * .9 * .9* .9 * .95 = 0.6925 




Derivation using A SIM transform. 


Find elements similar to Kr in some context which affects 
boiling point. 

Suppose, relevant context is 

CX = (.7*group + .3*period) Rule 1 

and the dependency is given by, 

CX -> boiling point: a = .75, 7 a = 1; Rule 2 

• a and 7 Q estimated by global multiple regression analysis. 

• Localize the search space within the neighborhood of the 
argument in question 

• Similarity cr and 7 ^ are computed according to the for- 
mulas: 

a(Arg 1 ,Arg 2 ) = £ W t * a^attr^Arg^, attr t {Ar g 2 )) 

la(Argi, Arg 2 ) = £ W L * ^(attr^Arg^, attrj(Arg 2 )) 

where, the weights W l are normalized such that the sum 
of weights is 1 . 


Assuming pairwise similarity a and y a values 

cr(gr8a, gr7a) = .2; y a = .95 

cr( per4, per3) = .8; y a = .95 

a(per4, per5) = .7; y a = .95 

and given by context in Rule 1, we get 



EH 


cr(Kr, Elem) 

7 a(Kr f Elem) 

Ar 

8a 

3 

.7*1 + .3*. 8 = .94 

.95 

Xe 

8a 

5 

.7*1 + .3*. 7 = .91 

.95 

Cl 

7a 

3 

.7*. 2+ .3*. 8 = .38 

.95 

Br 

7a 

mm 

.7*. 2+ .3*1 = .44 

.95 

1 

7a 

5 

.7*. 2+ .3*. 7 = .35 

.95 


Disregard elements with a * y a < threshold T. 

























Similarity transform reference and parameter estimation: 


BP(Kr) = BP(Element) [/*, j /n </>, 7 ^] 

Ac BP(Ar) = -185 frz = 1, 7/1 = 1,0 = 1,70 = 1] 

BP(Kr) = BP(Ar) = -185.94 

/i c = H = 1 , 

7/i C = 7/i * * 7 <t * ol * 7a 

= 1 * .94 * .95 *.7* 1 = .625 

<t>c = ^ = 1 

70c = 70 * ^ * la * * 7q 

1 * .94 * .95 * .7* 1 = .625 

similarly, 

Xe: BP(Xe) = -108 [1, 1, 1. 1] 

BP(Kr) = BP(Xe) = -108.91 
M = 1 

7 ^ = 1 * .91 * .95 *.7* 1 = .605 

<t> c = 1 

7 ^ c = 1 * .91 * .95 *.7* 1 = .605 


COMBINATION OF EVIDENCES 


— Take the reference value of the result as the weighted 
average value of the BPs where the weights are de- 
cided by a *7 a product. 

— The parameters are taken as the weighted average of 
the evidences. 

BP(Kr)= [(BP(Ar)*cr(Kr,Ar) + BP(Xe)*cj(Kr,Xe) 

]/(£*<) 

BP(Kr) = ( — 185*.94+ — 108*.91)/(.94 + .91) = -147.1 
H c = Efii/N (if A n not large) 

(1 + l)/2 = 1 
7^ c = £ 7/<,- * cr *a*'f a /N 

= (1*.94*.75*1 + 1*.91*.75*1)/(1 + 1) = .6175 
<t> c = z 4>i/N 
(1 + l)/2 = 1 

7(S C = 77**"*“* 1a/N 

= (1 * .94 * .75 * 1 + 1 * .91 * .75 * 1) = .6175 



Comparison of parameters: 


Parameter 

A SPEC 

A SIM 

Actual 

BP 

-149.8 

-147 

-152 

f 1 

1 

1 

- 

7 m 

.6925 

.6175 

- 

<t> 

1 

1 

- 

l<t> 

.6925 

.6175 

- 


Choose results obtained by A SPEC since it yields infer- 
ence with higher confidence. 



















CONCLUSION 

Plausible Reasoning provides a useful mechanism to manip 
ulate available knowledge base to infer conclusions not arriv 
able by traditional logic. 


